Design and Evaluation of Storage Organizations for Read-Optimized Main Memory Databases

نویسندگان

  • Craig Chasseur
  • Jignesh M. Patel
چکیده

Existing main memory data processing systems employ a variety of storage organizations and make a number of storagerelated design choices. The focus of this paper is on systematically evaluating a number of these key storage design choices for main memory analytical (i.e. read-optimized) database settings. Our evaluation produces a number of key insights: First, it is always beneficial to organize data into self-contained memory blocks rather than large files. Second, both column-stores and row-stores display performance advantages for different types of queries, and for high performance both should be implemented as options for the tuple-storage layout. Third, cache-sensitive B+-tree indices can play a major role in accelerating query performance, especially when used in a block-oriented organization. Finally, compression can also play a role in accelerating query performance depending on data distribution and query selectivity. Source Code: At https://www.cs.wisc.edu/quickstep

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Design and Evaluation of Storage Organizations for Read-Optimized Main Memory Databases: Supplemental Material

This document is a supplement to the authors’ paper Design and Evaluation of Storage Organizations for Read-Optimized Main Memory Databases [4]. It contains additional experimental results which demonstrate the interaction between join processing and storage organization.

متن کامل

High-Performance Concurrency Control Mechanisms for Main-Memory Databases

A database system optimized for in-memory storage can support much higher transaction rates than current systems. However, standard concurrency control methods used today do not scale to the high transaction rates achievable by such systems. In this paper we introduce two efficient concurrency control methods specifically designed for main-memory databases. Both use multiversioning to isolate r...

متن کامل

Trekking Through Siberia: Managing Cold Data in a Memory-Optimized Database

Main memories are becoming sufficiently large that most OLTP databases can be stored entirely in main memory, but this may not be the best solution. OLTP workloads typically exhibit skewed access patterns where some records are hot (frequently accessed) but many records are cold (infrequently or never accessed). It is still more economical to store the coldest records on secondary storage such ...

متن کامل

Revisiting Database Storage Optimizations on Flash

The database storage hierarchy has been heavily optimized for the performance characteristics of disks. Storage managers typically employ rowor column-oriented storage layouts, or a combination, to improve the I/O performance of different query workloads with disks. The recent rise of flash memory-based solid-state drives (SSDs) significantly change the performance characteristics of storage: t...

متن کامل

Aggregates Caching in Columnar In-Memory Databases

The mixed database workloads found in enterprise applications are comprised of short-running transactional as well as analytical queries with resource-intensive data aggregations. In this context, caching the query results of long-running queries is desirable as it increases the overall performance. However, traditional caching approaches are inefficient in a way that changes in the base data r...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • PVLDB

دوره 6  شماره 

صفحات  -

تاریخ انتشار 2013